Speech Recognition in the Automobile

نویسندگان

  • CARNEGIE MELLON
  • Nobutoshi Hanai
چکیده

Acknowledgments Chapter 1: Introduction Chapter 2: The SPHINX Speech Recognition System 1 2 3 5 2.1 Signal Processing ............................ 5 2.2 Clustering and Vector Quantization ..................... 6 2.3 Hidden Markov Models .......................... 7 2.4 Speech Unit ............................... 7 Chapter 3: The Motorola Car Database and AN4 Database 8 3.1 The Motorola Car Database ........................ 8 3.2 The AN4 Database ............................ 9 3.3 Summary ................................ 9 Chapter 4: Noise Characteristics in the Automobile 11 4.1 Noise Sources ............................. 11 4.1.1 Running Noise .......................... 11 4.1.2 Functional Noise ......................... 15 4.1.3 Outer Noise ........................... 17 4.2 Summary ............................... 17 Chapter 5: Speech Recognition in Adverse Environments: Previous Work 18 5.1 Auditory-Based Front Ends ....................... 18 5.2 Noise and Noise-Word Models ...................... 18 5.3 Cepstral Mean Normalization and the RASTA Method ............. 19 5.4 The CDCN Algorithm ......................... 19 5.5 Speech Recognition in the Car Environment ................. 22 5.6 Summary ............................... 23 Chapter 6: Recognition in the Motorola Car Database Task 24 6.1 Baseline System ............................ 24 6.2 Mel-Frequency Cepstral Coefficients .................... 25 6.3 Environmental Compensation Algorithms .................. 26 6.3.1 Cepstral Mean Normalization .................... 26 6.3.2 CDCN ............................. 27 6.3.3 Combination of Cepstral Mean Normalization and CDCN ......... 28 6.4 Histogram-based CDCN ......................... 29 6.5 Summary ............................... 31 Chapter 7: Noise Cancellation for Car Radio 32 7.1 Collection of Stereo Data ........................ 32 7.2 Adaptive Noise Cancellation ....................... 32 7.3 Recognition Results .......................... 34 7.4 Summary ............................. 35 Chapter 8: Conclusions and Suggestions for Future Work 36 8.1 Conclusions .............................. 36 8.2 Suggestions for Future Work ....................... 37 References 39

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...

متن کامل

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract   Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

Statistical Variation Analysis of Formant and Pitch Frequencies in Anger and Happiness Emotional Sentences in Farsi Language

Setup of an emotion recognition or emotional speech recognition system is directly related to how emotion changes the speech features. In this research, the influence of emotion on the anger and happiness was evaluated and the results were compared with the neutral speech. So the pitch frequency and the first three formant frequencies were used. The experimental results showed that there are lo...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...

متن کامل

Towards robust telephony speech recognition in office and automobile environments

This study is concerned with improving the robustness of our telephony speech recognition system. Our previous implementation of this system handled both landline and cellular speech produced in a relatively quiet environment, such as in a regular o ce. However, it was found to be unduly vulnerable to background noise. In particular, we wanted to improve the accuracy of the system in the enviro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006